SVD based Data Transformation Methods for Privacy Preserving Clustering
نویسندگان
چکیده
Nowadays privacy issues are major concern for many government and other private organizations to delve important information from large repositories of data. Privacy preserving clustering which is one of the techniques emerged to addresses the problem of extracting useful clustering patterns from distorted data without accessing the original data directly. In this paper two hybrid data transformation methods are proposed for privacy preserving clustering in centralized database environment based on Singular Value Decomposition (SVD). In hybrid method one, SVD and rotation data perturbation are used as a combination to obtain the distorted dataset. In hybrid method two, SVD and independent component analysis are used as a combination to obtain the distorted dataset. In SVD the data is analyzed in different perspectives to retain important information. Higher order statistics which contains more important information is utilized in independent component analysis. Experimental results demonstrate that the proposed methods are efficiently protects the private data of individuals and retains the important information for clustering analysis.
منابع مشابه
Privacy Preserving Clustering on Distorted data
In designing various security and privacy related data mining applications, privacy preserving has become a major concern. Protecting sensitive or confidential information in data mining is an important long term goal. An increased data disclosure risks may encounter when it is released. Various data distortion techniques are widely used to protect sensitive data; these approaches protect data ...
متن کاملCLUST-SVD: Privacy preserving clustering in singular value decomposition
Large repositories of data contain sensitive information that must be protected against unauthorized access. The protection of the confidentiality of this information has been a long-term goal for the database security research community and for the government statistical agencies. Recent advances in data mining and machine learning algorithms have increased the disclosure risks that one may en...
متن کاملA Privacy-Preserving Data Mining Method Based on Singular Value Decomposition and Independent Component Analysis
Privacy protection is indispensable in data mining, and many privacy-preserving data mining (PPDM) methods have been proposed. One such method is based on singular value decomposition (SVD), which uses SVD to find unimportant information for data mining and removes it to protect privacy. Independent component analysis (ICA) is another data analysis method. If both SVD and ICA are used, unimport...
متن کاملRevisiting "Privacy Preserving Clustering by Data Transformation"
Preserving the privacy of individuals when data are shared for clustering is a complex problem. The challenge is how to protect the underlying data values subjected to clustering without jeopardizing the similarity between objects under analysis. In this short paper, we revisit a family of geometric data transformation methods (GDTMs) that distort numerical attributes by translations, scalings,...
متن کاملA Privacy-Preserving Classification Method Based on Singular Value Decomposition
With the development of data mining technologies, privacy protection has become a challenge for data mining applications in many fields. To solve this problem, many privacy-preserving data mining methods have been proposed. One important type of such methods is based on Singular Value Decomposition (SVD). The SVD-based method provides perturbed data instead of original data, and users extract o...
متن کامل